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5 RANDOMLY GENERATED GLYCOPEPTIDE COMBINATORIAL LIBRARIES 

BACKGROUND OF THE INVENTION 

The present invention relates to a method for 

10 generating a combinatorial library of glycopeptides , and to 
glycopeptide libraries produced by the method. More 
particularly, it relates to a method for generating a 
combinatorial library of a cancer-associated mucin. 

Glycopeptides are a broad class of organic compounds 

15 that are important in diverse biochemical processes, 
including cell growth regulation, binding of pathogens to 
cells, intercellular communication, and metastasis. 
Biosynthetically, the carbohydrates of glycoproteins are 
attached (co- or posttranslat ionally) by glycosyl transf er- 

20 ase enzymes, each enzyme being specific for a particular 
monosaccharide unit and linkage type. Mono- and oligo- 
saccharides are transferred to proteins in the endoplasmic 
reticulum and linked to the NHj group on the side chain of 
an asparagine residue of the protein, to form N-linked 

25 oligosaccharides. Mono- and oligosaccharides also may be 
linked to the OH group on the side chain of a serine, 
threonine, or hydroxylysine residue, to form 0-1 inked 
oligosaccharides . 

Different glycoforms of the same protein are regularly 

30 found, which differ in the carbohydrate structures that are 
attached. The glycoforms vary in properties such as 
protease stability, affinity to receptors, and pharmaco- 
kinetic profile. To study the influence of glycosylation 
patterns on the properties of a glycopeptide, and to 

3 5 identify compounds useful in diagnosis and/or therapy, it 
is necessary to have access to several different glyco- 
forms. The most generally applied approach for obtaining 
defined glycopeptide fragments is chemical synthesis using 
glycosylated amino acids, although enzymatic glycosylation 

40 using glycosyltransf erases has also been used. 



Chemical synthesis of serine and threonine with large 
0-linked carbohydrate structures as building blocks for 
glycopeptide synthesis is much more difficult than 
synthesis of either oligopeptides or oligosaccharides. For 
example, a number of mono-, di- and trisaccharides have 
been identified on the core of the carcinoma-associated 
MUCl . Chemical synthesis of the a-O-linked N-acetyl- 
galactosamine-based mucin-type glycopeptides depends on the 
accessibility to large amounts of O-glycosylated amino 
acids. Sequential glycosylations of serine and threonine 
for Fmoc -based glycopeptide synthesis involves a complex 
manipulation of the select ivities of base- sensitive 
protecting groups. Fmoc-protected serine and threonine are 
among the more highly sensitive and hindered aglycons used 
in glycosylation reactions. It is virtually impossible to 
synthesize chemically all possible combinations of glyco- 
peptides and test them against a natural ant i -mucin 
antibody or polyclonal serum. 

Combinatorial methods have gained great interest as a 
method of finding desirable compounds. These methods 
involve creating libraries of related compounds. Inter- 
action of an antigen with the library is then measured, in 
order to assess whether one or more compounds in the 
library recognizes the antigen. The use of combinatorial 
chemistry to synthesize large numbers of molecules, either 
as individual compounds or as mixtures, is considered to be 
one of the frontiers of organic chemistry applied to drug 
discovery . 

To date, combinatorial chemistry has been applied to 
the generation of peptide and oligonucleotide libraries, 
where the well established chemistry of amide bond and 
phosphodiester bond formation led to rapid progress. 
Combinatorial chemistry has not been applied, however, to 
the generation of glycopeptide libraries. Glycosylation of 
the amino group on the side chain of an asparagine residue 
of the protein, to form N-linked oligosaccharides, may 
occur at any one of the three or four hydroxyl groups on 



the saccharide. For both N-linked and 0-linked oligo- 
saccharides, a new stereocenter is formed on glycosylat ion . 
This results in either an a- or ((3-glycosidic linkage, which 
are usually axial and equatorial, respectively. 

A need therefore exists for a method of generating a 
combinatorial library of glycopept ides . Such a library 
could be used to screen for biological activity of 
different glycoforms within the library. 

SX3MMARY OF THE INVENTION 

It is therefore an object of the present invention to 
provide a combinatorial method for the generation of 
glycopeptide libraries. 

It is a further object of the invention to provide a 
random library of glycopeptides . 

It is another object of the invention to provide a 
method of identifying glycopeptides that have a defined 
biological activity by screening a random library of 
glycopeptides for the biological activity. 

It is yet another object of the invention to provide 
a method of generating a combinatorial library of mucins. 

It is another object of the invention to provide a 
method of generating a combinatorial library of MUCl 
glycopeptides . 

These and other objects of the invention are achieved 
by a method of generating a glycopeptide library, com- 
prising (a) randomly glycosylating a platform having at 
least one glycosylation site with at least one glycosyl 
donor, optionally blocking unreacted glycosylation sites on 
the glycosylated platforms and optionally selectively 
removing one or more protecting groups on the carbohydrate 
groups introduced at the first level; whereby a first level 
library of glycosylated platforms is created; and then (b) 
optionally randomly glycosylating said first level library 
of glycosylated platforms, or a combination of first level 
libraries of glycosylated platforms, with at least one 
glycosyl donor, and optionally selectively removing one or 



more designated protecting groups on the carbohydrate 
groups introduced at the second level; whereby a second 
level library of glycosylated platforms is created. The 
method may further comprise randomly glycosylating the 
second level library of glycosylated platforms, or a 
combination of second level or first and second level 
libraries of glycosylated platforms, with at least one 
glycosyl donor, and optionally selectively removing one or 
more designated protecting groups on the carbohydrate 
groups introduced at the third level; whereby a third level 
library of glycosylated platforms is created; and 
optionally repeating the foregoing step to produce fourth 
and higher level libraries of increased diversity. 

In a preferred embodiment, the randomly-generated 
glycopeptide library comprises carcinoma-associated mucins 
or structures associated with adhesion ligands for 
bacterial receptors that are expressed on human cell 
surface antigens. Components of the library can be 
screened for drug-like, competitive inhibitory, 
immunostimulatory or antibody-like activity. 

Other objects, features and advantages of the present 
invention will become apparent from the following detailed 
description. It should be understood, however, that the 
detailed description and the specific examples, while 
indicating preferred embodiments of the invention, are 
given by way of illustration only, since various changes 
and modifications within the spirit and scope of the 
invention will become apparent to those skilled in the art 
from this detailed description. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a linear platform for combinatorial 
library synthesis according to the present invention. 

Figure 2 is an example of a cyclic platform for 
combinatorial library synthesis according to the present 
invention . 



Figure 3 shows a simple cyclic peptide and solubilized 
version which contains a lipid chain. 

Figure 4 shows a platform with unnatural glycosylation 
sites . 

Figure 5 is an example of a hybrid platform which does 
not include peptide linkages. 

Figure 6 is an example of a cyclic peptide for random 
glycosylations, in which solubility is enhanced by 
introduction of hydrophobic groups. 

Figure 7 shows carbohydrate structures found on cancer 
mucins . 

Figure 8 is a bar graph of the results of screening of 
a GSTA library, 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

In accordance with the present invention, it is 
possible to generate large glycopeptide libraries by random 
glycosylation of a selected peptide or peptide-like 
structure that has at least one glycosylation site. Random 
glycosylation of a peptide or peptide-like structure with 
more than one glycosylation site yields a large library of 
all possible combinations of glycosylation since each site 
of glycosylation is unique for a given sequence. The 
random libraries provide a fast and efficient way of 
screening for drug-like, competitive inhibitory, immuno- 
stimulatory, antibody- like and other biological activities. 
The size of the random library of glycopeptides depends on 
the extent of glycosylation in terms of numiber of sites and 
the variety of carbohydrate structures that are added. 

The "platform" or core used as the basis for the 
glycopeptide library is any peptide or peptide-like 
structure, including unnatural, synthetic structures, and 
may contain tandem repeats. The platform includes one or 
more glycosylation sites, which may be natural or 
unnatural. A platform with unnatural glycosylation sites 
is shown in Figure 4. Preferably there are from 2 to 5 
glycosylation sites on the peptide or tandem repeat, and 



more preferably 2 or 3 glycosylation sites. In a preferred 
embodiment, each glycosylation site on a platform is unique 
and distinguishable from other sites due to distinct 
structural features in the vicinity of the site. 

Where the platform is a peptide-like structure, it 
does not necessarily contain peptide linkages, but may 
comprise any structure with glycosylation sites to which 
carbohydrate structures may be attached. For example, a 
platform may be a "hybrid" platform comprising a non- 
peptide polymer or even a chain of carbon atoms to which 
natural amino acid side chains with natural glycosylation 
sites are attached. An example of a hybrid platform in 
shown in Figure 5. 

The glycosylation sites provide hydroxy functions for 
0-glycosylation and/or natural carboxy or carboxamido 
functional groups for N-glycosylation . Preferred 
glycosylation sites include one or more of serine, 
threonine and hydroxylysine , the hydroxyl group of which 
provides O-glycosylation sites, or asparagine, the amino 
group of which provides an N-glycosylation site. 

Glycosylation sites my be of either d- or 1-optical 
configuration, although it is preferable that the 
glycosylation sites consist entirely of d-optical 
configuration. It is more particularly preferred that the 
entire platform be constructed of d-amino acids. 

The platform may be linear (Figure 1) or cyclic 
(Figure 2) , and may carry UV-active or fluorescent labels 
to aid in detection during a process of screening a 
glycopeptide library produced using the platform. 
Hydrophobic amino acids preferably are incorporated in the 
platform, as shown in Figure 6, in order to increase the 
solubility of the platform in the organic solvents used to 
promote glycosylation reactions. In a preferred 

embodiment, glycosylation sites are spaced, singly or in 
clusters, between sequences that include hydrophobic amino 
acids such as alanine, phenylalanine, valine, leucine and 
isoleucine or unnatural hydrophobic amino acids. Lipid 



chains also can be incorporated into the platform to aid in 
the coating of microtiter plates, as shown in Figure 3. 

The peptide of the platform may be isolated or 
synthesized by any known method. The peptide may have free 
groups at the glycosylation binding sites, or it may be 
partially blocked, so that only certain glycosylation sites 
are available. The blocking groups may be introduced 
selectively during the synthesis of the platform, or may be 
introduced at any time during a series of glycosylation 
reactions on the platform. The blocking sites also may be 
selectively removed during any step of then process. 

Carbohydrate structures are randomly introduced or 
arranged on the platform to generate a diverse glycopeptide 
library. Carbohydrate structures may include or be derived 
from glycoproteins, proteoglycans and glycol ipids found on 
both normal and malignant cells. Carbohydrate structures 
may be pre-designed and restricted in number for more 
efficient screening of the resulting combinatorial library 
for specific purposes. Carbohydrate structures may be 
unnatural, i.e., a random combination of monosaccharides 
linked together with a mix of a- and ^-linkages. 

Glycosyl (carbohydrate) donors are selected based on 
knowledge of the carbohydrate moieties contained in a 
glycopeptide of interest. One preferred group of 

carbohydrate structures includes those structures known to 
be adhesion ligands for bacterial receptors that are 
expressed on human cell surface antigens . Another 
preferred group of carbohydrate structures include those 
known to be associated with malignant cell antigens. 

For example, N-acetylgalactosamine O-linked to serine 
or threonine is known as carcinoma-associated Tn antigen, 
while the disaccharide SGal (1-3 ) aGalNAc O-linked to serine 
or threonine in mucins is commonly known as carcinoma- 
associated Thomsen-Friedenreich (TF) antigen. The 
disaccharide is also found on the cell surface of chronic 
and acute myelogenous leukemic leukocytes. Sialylated 
versions of TF, particularly q;2-6-TF, have been found on 



the mucins expressed by human breast cancer cell lines. 
The structure and synthesis of TF family glycosyl donors is 
described by Qiu et al . , Tetrahedron Letters, 37:595-585 
(1996) . (The contents of this document, and all other 
documents specifically cited herein, are incorporated in 
this disclosure in their entirety by reference.) Examples 
of carbohydrate structures found on cancer mucins are found 
in Figure 7. 

Thus, when the desired glycopeptides are carcinoma- 
associated mucins, glycosyl donors used to glycosylate the 
core peptide include galactosamine , N-acetylgalactosamine , 
and sialyl. When generating a combinatorial library for 
glycopeptides other than Tn and TF glycopeptides, other 
glycosyl donors are employed. 

The number of glycosyl donors at each level preferably 
is at least 2, and more preferably at least 3. Typically, 
more donors are used at levels beyond first level than are 
used in generating the first level library. It may be 
preferred in some instances to limit the size of a library 
by limiting the number of donors at each level to less than 
5. Synthesis of glycosyl donors is well known in the art. 

Glycosyl donors can be designed, in accordance with 
the present invention, to yield only particular carbo- 
hydrate structures of interest. In this regard, a glycosyl 
donor may be designed with protecting groups, such as 
4 , 6-benzylidene, to favor formation of a particular 
carbohydrate structure. See, e.g., Qiu et al . , supra, and 
Yule et al.. Tetrahedron Letters, 36:6839-6842 (1995), and 
Broddefalk et al . , Tetrahedron Letters, 17:3011-3014 
(1996) . The glycosyl donors, as well as glycosylat ion 
sequence, may be selected based on a predetermined 
assessment of the nature and number of established 
carbohydrate structures for a selected glycopept ide . 

Glycosyl donors are reacted with a core peptide 
according to methods well known in the art. See, e.g., Qiu 
et al. and Yule et al . , supra, as well as Kunz, Angew. 
Chem. Int. Ed. Engl. 26:294-308 (1987) and Garg et al . , 



Advances in Carbohydrate Chemistry and Biochemistry, 
50:277-310 (1994). A first level library of a desired 
glycopeptide is created by primary glycosylation of the 
peptide with a single glycosyl donor or a mixture of 
donors. Reaction of a core peptide with a glycosyl donor, 
or mixture of donors, results in a library of randomly 
glycosylated glycopept ides . A first level library can be 
used in a screening process, as described below, or can 
form the basis for generating higher level libraries. 

A second level library is created by reacting one or 
more first level libraries with one or more further 
glycosyl donors. Prior to further reaction, unreacted 
glycosylation sites on the peptides may be blocked, e.g., 
by acetylation, in order to prevent these glycoforms from 
being eliminated from the library by being converted into 
different glycoforms. Following purification, the 

protecting groups of the carbohydrate structures on the 
glycoforms are selectively removed to create additional 
glycosylation sites on the existing carbohydrate 
structures. Random glycosylation with these additional 
donors further extends existing carbohydrate structures, 
thereby to create more complex glycopeptide structures . 
Higher level libraries are similarly created by reacting 
one or more second level or higher libraries with one or 
more further glycosyl donors . 

The total number of possible glycoforms contained in 
a combinatorial library of glycopept ides according to the 
invention can be calculated by the following formula: 

# of glycoforms = (x + 1)" 
where x is the number of glycosyl donors used and n is the 
number of glycosylation sites. For example, the following 
table shows the number of glycoforms that are obtained when 
up to five glycosyl donors and up to five glycosylation 
sites are used: 



Table 1 



# of 
sites 


carbohydrate structures (x) 


1 


2 


3 


4 


5 


1 


2 


3 








2 


4 


9 


16 


25 


36 


3 


8 


27 


64 


125 


216 


4 


16 


81 


256 


625 


1296 


5 


32 


243 


1024 


3125 


7776 



It is possible to selectively control the size and 
complexity of the glycopeptide libraries in a number of 
ways. The size and complexity of a random library of 
glycopeptides depend on the extent of glycosylation, which 
in turn depends on the number of glycosylation sites on the 
core peptide and on the number of glycosyl units that are 
added. Libraries with various combinations of glycoforms 
can be achieved by mixing lower level libraries before 
glycosylation, by mixing donors at each level, and by using 
mixed lower level libraries in combination with mixed 
donors. Sites of further glycosylation can be controlled 
by protecting unreacted glycosylation sites on the peptide, 
thereby preserving those structures in the library. 

The glycopeptide libraries can be used to screen for 
biologically active compounds. By screening a number of 
libraries, each with different combinations of carbohydrate 
structures on the core peptide, it is possible to identify 
which structures have drug-like, competitive inhibitory, 
immunostimulatory, antibody- like, and many other biological 
activities. A library according to the present invention 
can be screened for compounds that have ant i -bacterial 
activity, including compounds that competitively inhibit 
bacterial adhesion to a host cell. The library also may be 



screened for compounds that have anti-viral activity. In 
a preferred embodiment libraries are generated and screened 
to develop highly sensitive diagnostic antibodies and 
antigens for the detection and immunotherapy of cancers. 

Methods of screening for biological activities using 
combinatorial libraries are known in the art. U.S. Patent 
Nos . 5,510,240 and 5,541,061 provide examples of methods of 
screening comibinatorial libraries, but any suitable method 
of screening for biological activity using combinatorial 
libraries may be used in accordance with the present 
invention. For example, an ELISA of various antibodies 
binding either Ovine Submaxillary Mucin (OSM) or CA27.29 as 
the solid phase may be used in an inhibition format with 
components from the libraries as inhibitors. 

In a preferred embodiment, the peptide is derived from 
a cancer-associated mucin, and is in particular a MUCl core 
protein. The MUCl tandem repeat derived sequence 

GVTSAPDTRPAPGSTA, contains five 0-glycosylation sites, two 
serines and three threonines, and is an example of a 
peptide that can be glycosylated according to the present 
invention to create a glycopeptide library. If all 
possible glycosylation sites in a tandem repeat are used 
only once in primary glycosylation with N-acetylgalactos- 
amine (Tn antigen) , five different monoglycosylated tandem 
repeats result, but if glycosylation is randomized between 
0 and 5 sites, there are 32 different combinations of 
glycosylated tandem repeats. If 0 to 5 sialic acids are 
then randomly added at the 6 -position of the existing 
N-acetylgalactosamines, the possible number of glycoforms 
increases to 243. These will carry only combinations and 
varied numbers of Tn and STn. If another donor is added at 
each glycosylation, e.g., TF along with the first and 
GlcNAc along with the second, a total of 16807 glycosyla- 
tion variants of MUCl tandem repeat will be produced. This 
library will constitute more than 90% of all truncated 
versions (core structures) that may be associated with 



cancerous MUCl mucin. These are useful as vaccine 
components . 

A simpler tetrapept ide , GSTA, has only two 
O-glycosylation sites and is useful in demonstrating the 
method according to the invention, as shown in the 
following examples. The protecting groups on the glycosyl 
donors and the sequence of glycosylat ion reactions are 
designed to yield only the established carbohydrate 
structures possible in mucin biosynthesis. The "rules" 
observed by the glycosyl transferases in creating 
carbohydrate structures on mucins are shown in Table 2, 
wherein GalNAc is N- acetylgalactosamine , GlcNAc is N- 
acetylglucosamine, and Gal is galactose. Sialic acid is 
another name for N-acetylneuraminic acid. It is notable 
that the N-acetylgalactosamine-based glycosyl donors form 
only alpha linkages with serine and threonine hydroxyls. 

The library of GSTA glycopeptides modelled on 
naturally-existing mucins, is small enough that the 
components can be characterized by mass spectrometry. It 
is therefore very useful in gaining a precise understanding 
of glycosylat ion patterns of the MUCl core protein, which 
is necessary in order to design effective therapeutic 
vaccines and diagnostic tools. 



Table 2 . 



Donor 


Linkage 


Site 


Example 


GalNAc 


alpha 


serine , 
threonine 
and 3-0 
GalNac 


Tn, Core 6 


GlcNAc 


beta 


6-0 GalNac, 
3-0 Gal 


Fl alpha 


Sialic acid 


alpha 


6-0 GalNAc, 
3-0 Gal 


STn, STF 


Gal 


beta 


3- 0 GalNAc, 

4- 0 GlcNAc 


TF, Fl alpha 



The following examples illustrate generation of a 
10 library of GSTA glycopept ides according to the present 
invention, but do not limit the scope of the invention in 
any way. Further aspects and variations of the invention, 
based on the disclosure above and the following examples, 
will be apparent to the person of ordinary skill in the 
15 art. 

EXAMPLES 



Example 1 : Synthesis of protected peptides of GSTA 

GSTA is a four amino acid residue of MUCl, which has 

20 two unique sites for glycosylation, the serine residue (S) 
and the threonine residue (T) . It is manually synthesized 
in solution with N-terminal Fmoc and C-terminal benzyl, 
with serine and threonine hydroxyls free. 

Glycosyl donors N-acetylgalactosamine (Tn antigen) 

25 and SGal (1-3 ) CkiGalNAc (carcinoma associated Thomsen- 
Friedenreich, or TF antigen) are protected with 4,6- 
benzylidenyl protecting groups. Synthesis of protected 
peptides of GSTA is shown in Reaction Scheme I . 

- 13 - 



Preparation of compound 1 

A mixture of L-alanine, 20 g, 0.22 mol and toluene-4- 
sulphonic acid, 53 g, 0.28 mol in 100 mL of benzyl alcohol 
and 150 ml of benzene were refluxed overnight using a Dean 
stark apparatus. Benzene was removed in vacuo and 250 ml 
of ether was added to give compound 1, a solid, 75 g, 78%. 

[a]D-5.2 (cO.25, MeOH) ; 'H-NMR (300 MHz, CDC13), 8=8.25 (m, 
3 H, TSOH, 2 NHj) , 7.00-7.70 (m, 9 H, Ar-H) , 5.08, 4.98 (2 
d, 2 H, J=12.5 Hz, CHi) , 4.05 (dd, I H, J=7.5, 15.0 Hz, Ala- 

(x-H) , 2.30 (s, 3 H, PhCHj) , 1.43 (d, 3 H, J=7 . 0 Hz, CH3) . 

Preparation of compound 3 

To a solution of N-Boc-L-threonine 2, 10 g, 45.6 mmol in 80 
mL of dry THF was added 3.5 mL of N-methyl morpholine and 
4.5 mL of i -butyl chlorof ormate at -20 °C under nitrogen. 
After stirring at -20°C for 5 min., a solution of compound 
1, 15 g, 57 mmol and 3.5 ml of N-methyl morpholine in 40 ml 
of dry THF and 5 ml of dry DMF was added. After stirring 
at -20 °C for 3 0 min. , 10 mL of methanol was added and the 
solvent was removed in vacuo. The residue was purified by 
silica gel column using ethyl acetate/hexane (1:1) to give 
compound 3, a white powder after freeze drying from 
dioxane, 12.5 g, 71%. [a] D -39.2 {cO.25, MeOH); ^H-NMR (300 
MHz, CDCI3) , 5 = 7.30-7.50 (m, 5H, Ar-H), 7.10 (d, IH, J=7 . 0 
Hz, NH) , 5.56 (d, 1 H, J=8 . 0 Hz, NH) , 5.20, 5.14 (2 d, 2 H, 
J= 12.0 Hz, CHjPh) , 4.60 (m, 1 H, Ala-a-H) , 4.30 (m, 1 H, 
Thr-jS-H) , 4.14 (dd, IH, J=2 . 0 , 7.5 Hz, Thr- (x-H) , 3.46 (m, 
1 H, OH), 1.45 (s, 9 H, 3 CH3) , 1.41 (d, 3 H, J=7 . 5 Hz, 
CH3) , 1.18 (d, 3 H, J=6.5 Hz, CH3) . 

Preparation of compound 4 

A solution of compound 3, 8.7 g, in 50 mL of formic acid 
was stirred at room temperature for 5 hours. Formic acid 
was removed in vacuo and ethyl acetate was added to the 
residue to give compound 4, a solid, 5.3 g, 71%. 
[a] D-66 . 8 (cO. 25 , MeOH); ^H-NMR (300 MHz, CDCI3+CD3OD) , 5 = 8.42 
(s, 1 H, HCOOH) , 7.30-7.50 (m, 5 H, Ar-H), 7.10 (d, 1 H, 



J=7.0 Hz, NH) , 5.56 (d, I H, J=8 . 0 Hz, NH) , 5.17, 5.09 (2 
d, 2 H, J=12.0 Hz, CH2Ph) , 4.50 (dd, 1 H, J=7.5, 15.0 Hz, 
Ala- (a-H) , 3.89 (m, 1 H, Thr-a-H) , 3.48 (d, 1 H, J=7 . 5 Hz, 
Thr- (X-H) , 1.41(d, 3 H, J=7 . 5 Hz, CH3),1.23 (d, 3 H, J=6 . 5 
Hz, CH3) ; ES-MS. Calc. for C14H20O4N2, 280.3 ; Found, 279.4. 

Preparation of compound 5 

A solution of N-Fmoc-L-glycine, 100 g, 0.336 mol and DCC, 
139 g, 0.67 mol in 1000 mL of dry ethyl acetate was stirred 
at 5°C for 15 min. A solution of N-hydroxysuccinimide , 
13 8.8 g, 0.67 mol in 1000 mL of dry ethyl acetate was 
added. After stirring at 5°C for 2 hours, the solid was 
filtered and washed with ethyl acetate (3x50 mL) . The 
solvent was removed in vacuo and the residue was purified 
by crystallization from hexane and ethyl acetate to give 
compound 5, a solid, 83.2 g, 63%. [a] D+6 . 0 (cO. 25 , MeOH) ; 
Ih-NMR (300 MHz, CDCI3) , 5=7.30-7.80 (m, 8 H, Ar-H) , 5.42 
(t, 1 H, J=5.5 Hz, NH) , 4.43 (d, 2 H, J=7 . 0 Hz, CH^ on Fmoc) , 
4.37(d, 2 H, J=6.0 Hz, Ala- (a-H) , 4.23(t, 1 H, J=7 . 0 Hz, GH 
on Fmoc), 2.84 (s, 4 H, 2 CH2) . 

Preparation of compound 7 

To a solution of L-serine, 20 g, 0.18 mol and sodium 
bicarbonate, 16 g in 150 mL of water was dropped compound 
5, 5 6 g, 0. 142 mol in 600 ml of DME . The mixture was 
stirred at room temperature overnight. DME was removed in 
vacuo and the 5% of aqueous HCI was added to adjust pH=2 . 
The water solution was extracted with ethyl acetate (3x200 
mL) . The organic layer was washed with water and dried 
over Na2S04. The solvent was removed in vacuo and the 
residue was purified by silica gel column using ethyl 
acetate/acetic acid (9:1) to give compound 7, a white 
powder after freeze drying from dioxane, 3 5 g, 62%. 
[q;]D+9.6 (cO.50, MeOH); 'H-NMR (3 00 MHz, CDCI3+CD3OD) , 
6=7.30-7.80 (m, 8 H, Ar-H), 4.61 (t, 1 H, J=4 . 0 Hz, ser- 
Q,_H) , 4.45 (d, 2 H, J=7.0 Hz, CH2 on Fmoc), 4.30 (t, 1 H, 
J=7.0 Hz, CH on Fmoc), 4.03 (dd, 1 H, J=4 . 0 , 11.5 Hz, ser- 



a-H) , 3.90-4.08 (m, 3H) . ES-MS: Calc. for C20H20O7N2, 384.3 ; 
Found, 3 84.3. 

Preparation of compound 8 

5 To a solution of compound 1. 1.7 g, 4.25 mmol in 2 0 mL of 
dry THF was added 0.44 niL of N-methylmopholine and 0.52 mL 
of i-butylchloroformate at -20°C under nitrogen. After 
stirring at -20°C for 5 min., a solution of compound 3, 
1.12 g, 4 mmol and 0.44 mL of N-methylmorpholine in 10 mL 

10 of dry THF and 5 mL of dry DMF was dropped in 5 min. After 
stirring at -20°C for 30 min., 5 mL of methanol was added 
and the solvent was removed in vacuo. The residue was 
purified by silica gel column using hexane/ethyl 
acetate/methanol (10:10:1) to give compound 8, a white 

15 powder after freeze drying from dioxane, 1.7 g, 66%. 

[o;]D-36.5 (cO.2, MeOH : CHCl3=4 : 1 ) ; IH-NMR (300 MHz, 
CDCI3+CD3OD) , 6=7.30-7.80 (m, 13 H, Ar-H) , 4.99, 5.05 (2d, 
2 H, J=12.5 Hz, CHjPh) , 4.34-4.45 (m, 2 H) , 4.24-4.32 (m, 3 
H) , 4.08-4.20 (m, 2 H) , 3.78-3.85 (m, 2 H) , 3.75 (d, 2 H, 

20 7.0 Hz, CH2) , 3.53-3.62 (m, 1 H) , 1.28 (d, 3 H, J=7 . 0 Hz, 
CH3) , 1.06 (d, 3 H, J=6.0 Hz, CH3) . ES-MS; Calc. for 
C44H37O9N4; 645.67. Found; 646.0 (M+H) , 669.3 (M+Na) . 

Example 2 : Synthesis of donors for first level library 

25 Synthesis of donors for the first level of the library 

is summarized in Reaction Scheme II. 

Preparation of compound 9 

A mixture of N-acetyl -D-galactosamine , 20 g, 90.4 mmol, 20 
3 0 mL of benzaldehyde dimethyl acetal and 200 mg of 
p-toluenesulf onic acid in 500 mL of dry acetonitrile was 
stirred at 60°C for 5 hours. After cooling to room 
temperature, the solid was filtered, washed with CH2CI2 
(3x20 mL) and dried in vacuo to give compound 9, a solid, 
35 22 g, 88%. [o;] D +133.0 (cl.O, HjO). =H-NMR (DMS0-d6) : 

6=7.35-7.71 (m, Ar-H), 5.62 (s, 1 H, CHPH) , 5.12 (d, 1 H, 
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J=3.0 Hz, H-1) , 4.21 (bd, 1 H, J=3 . 5 Hz, H-4), 4.12 (dd, 1 
H, J=3.5, I 1.0 Hz, H-2) , 4.06 (m, 2 H, H-6), 3.92 (m, 

1 H, H-3) , 3.85 (m, I H, H-5), 1.90 (s, 3 H, NAc) . '^C-NMR: 
6=99.7 (CHPh) , 91.4 (C-1). 

Preparation of compound 10 

To a solution of compound 9, 20 g, 64.7 mmol in 150 mL of 
dry pyridine was dropped 8.9 mL of benzoyl chloride in 4 0 
mL of CH2C12 at -25^C under argon in 30 min. and stirred for 

2 hours. After adding 5 mL of ethanol the solvent was 
removed in vacuo. Aqueous workup (CHjClj) and 
recrystallization from ethyl acetate gave compound 10, a 
solid, 16 g, 62%.[a;]D +182.0 (cl, MeOH) ; 'H-NMR: 87.30-8.10 
(m, Ar-H) , 6.05 (s, 1 H, NH) , 4.00-5.40 (m) , 1.88 (s, 3 H, 
NAc) . 

Preparation of compound 11 

To a solution of compound 10, 10 g, 24.9 mmol in 100 mL of 
dry CH2CI2 was added 7 mL of trichloroacetonitrile and 0.5 
mL of DBU. The solution was stirred for 2 hours at room 
temperature and solvent was removed in vacuo. The residue 
was purified by silica gel column using hexane/ethyl 
acetate (1:1) to give compound 11, a white powder after 
freeze drying from benzene, 11.5 g, 85%. [a] D +151.0 
(cl.O, CHCI3) . IE NMR: 5 = 8.85 {s, 1 H, NH) , 7.408.10 (m, 
Ar-H), 6.75 {d, 1 H, J=3 . 5 Hz, H-1), 5.65 (d, 1 H, J=9 . 0 
Hz, NH) , 5.60 (s, 1 H, CHPH). 5.50 (dd, 1 H, J=3.5 Hz, H- 
3), 4.00-5.15 (in), 1.90 (s, 3H, NAc). 

Preparation of compound 12 

To a solution of N-acetyl galactosamine , 50 g, 0.226 mol in 
100 ml of dry allyl alcohol was dropped 10 N HCl in 50 ml 
of THF. After stirring at 50 °C overnight the solvent was 
removed in vacuo and the residue was purified by 
recrystallization from ethanol to give compound 12, a 
solid, 35 g, 59% - [a]D +165.00 (cl.O, H2O). ^H NMR (D20): 



6 = 6.00 (m, 1 H, CH=) , 5.30 (m, 2 H, =CR^) , 4.95 (d, 1 H, 
J=3.5 Hz, H-1) , 3.70-4.30 (m) , 2.01 (s, 3 H, NAc). 

Preparation of compound 13 

5 A mixture of compound 12, 35 g, 0. 134 mol, 62 mL of 
benzaldehyde dimethyl acetal and 350 mg of 
p-toluenesulfonic acid in 600 mL of dry acetonitrile was 
stirred at 60 °C for 4 hours. The solvent was removed in 
vacuo and the residue was purified by recrystallization 
10 from ethanol to give compound 13, a solid, 2 9 g, 62%. [a] D 
+155.0 (cl.O, MeOH) . ^HNMR: 8=7.30-7.60 (m, Ar-H) , 5.80 (m, 
1 H, CH-) , 5.57 (s, 1 H, CHPh) , 5.26 (m, 2 H =CH2) , 5.00 (d, 
1 H, J=3.5 Hz, H-1), 3.70-4.55 (m) , 2.90 (d, 1 H, J=10.0 
Hz, OH) , 2 . 04 (s, 3 H, NAc) . 

15 

Preparation of compound 14 

A solution of 13, 20 g, 57.3 mmol, 40 g of tetra- 0 -acetyl - 
bromo- (a-D-galactopyranoside and Hg(CN)2, 24 g in 50 mL of 
dry benzene/ 50 mL of dry nitromethane were stirred at 50 °C 

20 overnight under argon. Tetra-O-acetyl-bromo- (a-D- 

galactopyranoside (25 g) and Hg(CN)2 (15 g) were added and 
the stirring was continued for 4 hours. The solvent was 
removed in vacuo and the residue was dissolved in 10 0 0 mL 
of CH2C12, washed with sat ' d Na2C03, 30% KBr and water before 

2 5 drying over Na2S04. The solvent was removed in vacuo and 
the residue was purified by silica gel column (1:1 to 1:4 
hexane/ethyl acetate) to give compound 14, a white powder 
after freeze drying from dioxane, 25 g, 64%. [a] D +96.00 
(c2.0, MeOH). 'H-NMR (300 MHz, CDCI3) : 8=7.40-7.70 (m, Ar- 

30 H) , 5.88 (m, 1 H, CH=) , 5.65 (d, 1 H, J=9 . 5 Hz, NH) , 5.55 
(s, 1 H, CHPH), 5-35 (m, 1 H, H-4b) , 5.25 (m, 3 H) , 5.03 
(d, I H, J=3.5 Hz, H-la) , 4.95 (dd, 1 H, J=3.5, 10.0 Hz, H- 
3b), 4.75 (d, 1 H, J=7.5 Hz, H-lb), 3.6-4.70 (m) , 2.14, 
2.04, 1.98, 1.96 (4 s, 15 H, 5 Ac). 

35 
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Preparation of compound 15 

To a solution of compound 14, 2 3 g, 3 3 mmol and 
[bis (methyldiphenylpilosphine) 1(1, 5 -cyclooctadiene) - 
iridium(l) hexaf luorophosphate , 500 mg in 500 mL of dry THF 
5 was bubbled argon for 15 min. and hydrogen for 5 min. The 
solution was stirred at room temperature for 4 hours. 15 g 
of I-,, 8 00 mg of dimethylaminopyridine , 10 mL of pyridine 
and 80 mL of water were added and the solution was stirred 
at room temperature overnight. Sodium sulfate (5%, 200 mL) 

10 was added and THF was removed in vacuo. Aqueous workup 
(CH2CI2) and silica gel purification (5:10:2 hexane/ethyl 
acetate/methanol) gave compound 15, a white powder after 
freeze drying from dioxane, 12.5 g, 58%. [q;]D+55.6° (cl.O, 
MeOH) . ^H-NMR(300 MHz, CDCI3) : 5 = 7.35-7.60 (m, Ar-H) , 6.50 

15 (d, 1 H, J=9.0 Hz, NH) , 5.45 (s, 1 H, CHPh) , 5.33 (d, 1 H, 

J=3.5 Hz, H-la) , 5.15 (m, 2 H, H-2b & H-4b) , 4.96 (2d, 1 H, 
J=3.5, 10.5 Hz, H3b) , 4.68 (d, 1 H, J=8 . 0 Hz, H-lb), 3.60- 
4.50 (m) , 2.11, 2.02, 2.00, 1.96, 1.94 (5 s, 15 H, 5 Ac). 

2 0 Preparation of compound 16 

To a solution of compound 15, 7.5 g, 11.5 mmol in 100 mL of 
dry CH2CI2 was added 2.6 mL of trichloroacetonitrile and 
0.25 mL of DBU- The solution was stirred for 2 hours at 
room temperature and solvent was removed in vacuo. The 
25 residue was purified by silica gel column using 
hexane/ethyl acetate (1:2) to give compound 16, a white 
powder after freeze drying from benzene, 5.6 g, 61%. [a] D 
+ 81.2 (cl.O, CH2CI2) . ^H-NMR (300 MHz, CDCI3) : 6 = 8.75 (s, 1 
H, NH) , 7.35-7.60 (m, Ar-H), 6.62 (d, 1 H, J=3 . 5 Hz, H-la), 
30 5.80 ( d, I H, J=8.0 Hz, NH) , 5.52 (s, 1 H, CHPh), 5.40 
(2d, 1 H, J=1.0, 3.5 Hz, H-4b) , 5.25 (dd, 1 H, J=8.0, 10.0 
Hz, H-2b) , 5.01 (dd, 1 H, J=3.5, 10.5 Hz, H-2a) , 4.90 (d, 
1 H, J=8.0 Hz, H-lb), 3.80-4.80 (m) , 2.15, 2.04, 2.02, 
2.00, 1 . 96 (5 s, 15 H, 5 Ac) . 
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Example 3: Synthesis of donors for second level library 

Synthesis of donors for the second level of the 
library is summarized in Reaction Scheme III. 



5 Preparation of compound 17 

K. Fufase, et al . , Tetrahedron Lett., 36: 7455-7458. 
To a solution of D-glucosamine hydrochloride, 34 g, in 1 L 
of water was added trichloroethyl chlorof ormate , 3 3 mL 
dropwise over 2-3 hours at 0°C. After stirring at room 

10 temperature overnight, the solid was filtered out and 
washed with water and then ether. The solid was 

recrystallized from ethanol to give N- troc-D-glucosamine , 
59.5 g, 98%. A solution of N-troc-D-glucosamine, 13.5 g, 
3 8.2 mmol in 100 mL of allyl alcohol with 5% HCl was 

15 stirred at 100 °C for 30 min . After cooling to room 
temperature, allyl alcohol was remove in vacuo and the 
residue was dissolved in 250 mL of acetonitrile . 12 ml of 
PhCH(0]y[e)2 and 100 mg of TSOH were added. The mixture was 
stirred at room temperature over night under argon. 10 g of 

2 0 NaiCOj was added and the mixture was stirred for 10 min. 

Solid was removed and washed with acetone (5x5mL) . The 
solvent was removed and the solid was purified by 
crystallized from ethanol to give compound 17, a solid, 12 
g, 65%. 'H-NMR (300 MHz, CDCI3) . 6 = 7.30-7.60 (m, 5 H, Ar- 
25 H) , 5.80-6.00 (m, 1 H, CH=) , 5.55{ s, 1 H, CHPh) , 5.20-5.40 
(m, 2 H, =CH2) , 4.92 (d, 1 H, J=3 . 0 Hz, H-1), 4. 82, 4.69 (2 
d, 2 H, J=12.0 Hz, CH2) , 3.70-4.30 (m) , 3.59 (t, 1 H, J=8 . 0 
Hz, H-4) , 2.67 (d, IH, J=2 . 0 Hz, OH). 

3 0 Preparation of compound 18 

A solution of compound 17, 8 g, 16.6 mmol in 20 mL of 
pyridine and 10 mL of acetic anhydride was stirred at room 
temperature overnight . The solvent was removed and the 
residue was purified by crystallization from ethanol to 
35 give compound 18, a solid, 4.7 g, 54%. [a] D +58.5 (cl, 
ethyl acetate); 'H-NMR (300 MHz, CDCI3) : 6 = 7.30-7.50 (m, 5 
H, Ar-H) , 5.806.00 (m, 1 H, CH=) , 5.53 (s, 1 H, CHPH), 5.38 



(t, 1 H, J=10 Hz, H-3), 5.20-5.35 (m, 2 H, ^CH^) , 4.93 (d, 
1 H, J-3.5 Hz, H-1), 4.79, 4.68 (2 d, 2 H, J=12 . 0 Hz, CH2) , 
3.65-4.35 (m) , 2. 1 0 (s, 3 H, CH3) . 



5 Preparation of compound 19 

To a solution of compound 18, 9.1 g, 17.4 mmol in 3 00 mL of 
THF was added [bis (methyldiphenylphosphine) ] ( 1 , 5 - 
cyclooctadiene) -iridium (1) hexaf luorophosphate , 350 mg . 
Argon was passed to the solution for 10 min. following by 
10 hydrogen, 2 0 min. The solution was stopped and stirred at 
room temperature for 4 hours. 500 mg of DMAP, 6.25 mL of 
pyridine, 50 mL of water and 9.3 5 g of I2 were added. The 
mixture was stirred at room temperature overnight. THF was 
removed and the mixture was dissolved in 1 L of CHjClj. The 
15 solution was washed with sat. sodium carbonate, IN HCI and 
water, dried over Na2S04. The solvent was removed and the 
residue was purified by silica gel column using 
hexane/ethylacetate=7 : 3 to give compound 19, a white powder 
after freeze drying from dioxane, 5.5 g, 65%. [a] D +60.0 
20 (cl, ethyl acetate); 'H-NMR (300 MHz, CDCI3) : 6=7.30-7.50 

(m, 5 H, Ar-H) , 5.53 (s, IH, CHPh) , 5.52 (d, 1 H, J=10 . 0 
Hz, NH) , 5.43 (t, 1 H, J=10 Hz, H-3), 5.33(d, IH, J=3.5 Hz, 
H-1), 4.81, 4.66 (2 d, 2 H, J=12 . 0 Hz, CH2) , 3.65-4.35 (m) , 
3.17 (dd, IH, J=1.0, 3.5 Hz, OH), 2.06 (s, 3 H, CHj) . 

25 

Preparation of compound 2 0 

A solution of compound 19, 5.5 g, 11 mmol, 2.42 mL of 
tricholoroacetonitrile and 6 drops of DBU in 70 mL of CH2CI2 
was stirred at room temperature for 2 hours under argon. 

3 0 The solvent was removed in vacuo and the residue was 
purified by silica gel column using hexane/ethyl 
acetate (7:3) to give compound 20, a white powder after 
freeze drying from benzene, 5.59 g, 81%. [a] D +59.0 (cl, 
ethyl acetate); 'H-NMR (300 MHz, CDCI3) : 8=8.80 (s, 1 H, 

35 NH) , 7.307.50 (m, 5 H, Ar-H), 6.40 (d, 1 H, J=3.5 Hz, H-1), 
5.56 (s, IH, CHPh), 5.46 (t, 1 H, J=10 Hz, H-3), 5.36 (d, 
1 H, J=9.0 Hz, NH) , 4.71 (dd, 2 H, J=12 . 0 Hz, CHj) , 4.35 



- 21 - 



(dd, 1 H, J=5.0, 10.0 Hz, H-4) , 4.27 (m, 1 H, H-2), 3.75- 
4.00 (m) , 2.11 (s, 3 H, CH3) . 

Preparation of compound 21 

5 G. Grundler and R. R. Schmidt, Libigs Ann. Chem. , 1984, 
1826-1847 . 

To a solution of 2.0 mL of HC104 (70%) in 300 mL of acetic 
anhydride was added lactose, 100 g, 0.29 mol by portion to 
keep the temperature between 3 0 to 35°C. After adding of 

10 15 g of red phosphorous, the mixture was cooled in ice- salt 
bath and 90 g (29 mL) of Br2 was dropped to keep the 
temperature below 20°C. 15 mL of water was added in 15 min. 
The solid was filtered out and washed with acetic acid. 
The mixture was added to a solution of 100 g of zinc and 11 

15 g of CUSO4 in 290 mL of water and 200 mL of acetic acid in 
2 hour at -20°C. The mixture was stirred at -20°C for 2 
hours and the solid was filtered out and washed with acetic 
acid. The solvent was removed and the residue was purified 
by silica gel column using hexane/ethyl acetate (1:1) to 

2 0 give compound 21, a white powder after freeze drying from 

benzene, 100 g, 62%. ^H-NMR (300 MHz, CDCI3) : 8 = 6.32(dd, 1 
H, J=2.0, 6.5 Hz, =CH) , 5.32 (m, 1 H, -CH) , 5.27 (dd, I H, 
J= 1.0, 3.5 Hz, H-4'), 5.09 (dd, 1 H, J=8.0, 11.0 Hz, 
H-2'), 4.92 (dd, 1 H, J=3 . 5 , 10.5 Hz, H-3 ' ) , 4.74 (dd, 1 H, 
25 J=3.5, 5.0 Hz, H-3), 4.60 (d, I H, J=8.0 Hz, HI'), 
3.80-4.40 (m) , 2.07, 2.03, 2.00, 1.97, 1.96, 1.89 (6 s, 18 
H, 6 Ac) . 

Preparation of compound 22 

30 G. Grundler and R. R. Schmidt, Libigs Ann. Chem., 1984, 
1826-1847 . 

To a solution of compound 21, 100 g, 0.179 mol and 300 g of 
eerie ammonium nitrite (CAN) was added 20 g of sodium azide 
at -20°C under nitrogen. After stirring at -20°C for 6 

3 5 hours, 1 L of water was added and the mixture was extracted 

with ether (5x500 mL) . Ether was washed with water and 
evaporated. The residue was dissolved in 1 L of dioxane 
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and 10 0 g of NajNOj in 3 0 mL of water was added, the mixture 
was stirred at 80°C for 6 hours. 500 mL of water was added 
and the mixture was extracted with CH2CI2 (5x400 mL) . The 
solution was washed with water and dried over Na2S04. The 
solvent was removed in vacuo and the residue was purified 
by silica gel column using hexane/ethyl acetate (1:1 to 
1:2) as eluant to give compound 22, a whiter powder after 
freeze drying from benzene, as a mixture of a, /?- isomers. 

Preparation of compound 23 

G. Grundler and R. R. Schmidt, Libigs Ann. Chem. , 1984, 
1826-1847 . 

To a solution of compound 22, 2 0 g, 32.4 mmol in 3 00 mL of 
CH2CI2 was added 8 mL of CI3CCN and 1.7 mL of DBU. The 
mixture was stirred at room temperature for 3 hours under 
argon. The solvent was removed in vacuo and the residue 
was purified by silica gel column using hexane/ethyl 
acetate from 1:1 to 2:3 to give compound 23, a white powder 
after freeze drying from benzene, 11.5 g, 47%. ^H-NMR (300 
MHz, CDCI3) : 5 = 8.80 (s, 1 H, NH) , 6.44 (d, 1 H, J=3 . 5 Hz, 
H-1) , 5.51 (dd, 1 H, J=9.0, 10.5 Hz, H-3), 5.36 (dd, 1 H, 
J=1.0, 3.0 Hz, H-4'), 5.13 (dd, 1 H, J=8 . 0 , 10.0 Hz, H-2 ' ) , 
4.96 (dd, 1 H, J=3.5, 10.5 Hz, H-3 ' ) , 4.51 (d, I H, J=7 . 5 
Hz, H-1'), 3.70-4.50 (m) , 3.62 (dd, 1 H, J=3 . 5 , 10.5 Hz, 
H-2), 1.90-2.20(6 s, 18 H, 6 Ac). 

Preparation of compound 24 

T. J. Martin and R. R. Schmidt, Tetrahedron Lett., 
36: 7455-7458. 

A solution of sialic acid, 10 g and IR120+ resin, 4 g in 
1000 mL of dry methanol was stirred at room temperature 
overnight under argon. The resin was filtered out and 
washed with methanol . The solvent was removed to give 
compound 24, a solid, 10.65 g, 100%. 



Preparation of compound 2 5 

T. J. Martin and R. R. Schmidt, Tetrahedron Lett., 
36: 7455-7458. 

To a solution of 20 mL of acetic anhydride was added 0 . 1 mL 
5 of HCIO4. Compound 24, 10.65 g, was added to the mixture by 
portion to keep the temperature between 3 0 to 35 °C. After 
finishing adding the mixture was stirred at room 
temperature for 3 hours. The solution was poured onto ice 
water and kept at room temperature overnight. The solution 

10 was extracted with chloroform (3x100 mL) and washed with 
sat. NaHCOs and water, dried over Na2S04 . The solvent was 
removed in vacuo and the residue was purified by silica gel 
column using hexane/ethyl acetate/methanol (5:20:1) to give 
compound 25, a white powder after freeze drying from 

15 dioxane, 6 g, 76% as a mixture of a;,p-isomers ^H-NMR of 
major isomer (300 MHz, CDCI3) : 6 = 6.15 (d, 1 H, J=10.0 Hz, 
NH) , 5.39 (dd, 1 H, J=2.0, 4.5 Hz, H-7) , 5.23 (m, 1 H, H- 
8), 5.16 (t, 1 H, J=9.0 Hz, H-4) , 4.97 (s, 1 H, OH), 4.60 
(dd, 1 H, J=2.0, 10.0 Hz, H-9a) , 4.18 (t, 1 H, J=10 . 0 Hz, 

20 H-5) , 4.03 (dd, 1 H, J=8 . 0 , 11.5 Hz, H-9b) , 3.85 (s, 3 H, 
CH3) , 2.22 (d, 1 H, J=7.0 Hz, H-3e) , 2.15, 2.07, 2.02, 2.01, 
1.90, (5 s, 15 H, 5 Ac). 2.02 (m, 1 H, H-3a) . 

Preparation of compound 2 6 

25 T J. Martin and R. R. Schmidt, Tetrahedron Lett., 
36: 7455-7458. 

To a solution of compound 25, 7.2 g, 14.4 mmol in 150 mL of 
dry acetonitrile was added 6 mL of EtNi-Prj and 4.5 mL of 
phospholoride diethyl ester. The solution was stirred at 
3 0 room temperature for 10 min. and the solvent was removed in 
vacuo. The residue was purified by silica gel column using 
hexane/acetone 1 : 1 to give compound 26, a syrup, 8.5 g, 97% 
as a mixture of (o; , |3 - isomers . 'H-NMR (3 00 MHz, CDCI3) : 
5=2.80 (dd, 0.33 H, H-2e/3) . 2.50 (dd, 0.67H, H-2e a). 
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Rxample 4 : Generation of first level library 

GSTA is glycosylated separately with the protected Tn 
and TF donors to produce first level libraries with four 
glycoforms each: 

1. Tn: 00, TnO , OTn, TnTn 

2. TF: 00, TFO, OTF, TFTF 
where "0" means no glycosylat ion . 

GSTA also is reacted with Tn and TF as a mixture. The 
result is a first level library that contains nine 
glycoforms : 

3. Tn + TF: 00, TnO, OTn, TnTn, TFO, OTF, TFTF, 

TnTF , TFTn 

As can be seen, the first level libraries 1, 2 and 3 
above are different in composition and/or size. Library 3 
formed with mixed Tn and TF glycosyl donors contains all 
the glycoforms that occur in libraries 1 and 2, formed when 
Tn and TF are used separately as glycosyl donors, in 
addition to other components that arise from the use of 
mixed donors. A library formed with a mixture of two 
donors is therefore more comprehensive than a library 
formed by combining two libraries produced when the two 
donors are reacted with the core peptide separately. 
Generation of a first level library is summarized in 
Reaction Scheme IV. 

Preparation of compounds 27 

A solution of compound 8, 0.6 g, 0.925 mmol , compound 11, 
2 g, 3.7 3 mmol, compound 16, 2.45 g, 3.088 mmol and 3.5 g 
of 3 A molecular sieves in 2 0 mL of dry THF was stirred at 
room temperature for 10 min. under argon and cooled to 
-20 °C. Ten mL of 0.1 mol of BF3Et02 in dry THF was added in 
5 min. After stirring at -20 °C for 1 hour, the reaction 
mixture was then warmed to room temperature. The molecular 
sieves were filtered out and washed with ethyl acetate 
(3x20 mL) . The solvent was removed in vacuo and the 
residue was purified by silica gel column using 
hexane/ethyl acetate/methanol (10:10:1.5) to give 2.3 g of 



a mixture as a white powder after freeze drying from 
dioxane . 

Preparation of compounds 2 8 

A solution of compound 27, 2.3 g in 40 mL of dry pyridine 
and 5 mL of dry acetic anhydride was stirred at room 
temperature overnight under argon. The solvent was removed 
in vacuo and the residue was purified by silica gel column 
using hexane/ethyl acetate/methanol (10:10:1.5) to give 
2.34 g of a white powder after freeze drying from dioxane. 

Preparation of compounds 2 9 

A solution of compound 28, 1.9 g in 30 mL of 80% aqueous 
acetic acid was stirred at 80°C for 2 hours. The solvent 
was removed in vacuo and the residue was purified by silica 
gel column using hexane/ethyl acetate/methanol (5:10:3) to 
give 1.61 g of a white powder after freeze drying from 
dioxane . 

Preparation of compound 3 0 

A mixture of compound 29, 150 mg and 5 mL of 0 . 1 N NaOH in 
5 mL of methanol was stirred at room temperature for 24 
hours . Then 5 mL of 0 . 1 N NaOH and 5 mL of methanol was 
added and the stirring was continued for 24 hours. IR120+ 
resin was added to adjust pH=4 . 5 . The resin was filtered 
out and washed with water (5x5 mL) . The water was removed 
in vacuo and the residue was purified by P-2 column using 
water as eluant to give a solid, 13.1 mg . 



ES-MS 



30-1 



30-2 



30-3 



30-5 



30-4 



30-6 



Calc 



Calc 



Calc 



Calc 



Calc 



Calc 



for C2oH3gO jjNj , 53 7.50. Found , 

for C2gH4g0i7N6, 74 0.31. Found, 

for C25H450i7N5, 6 9 9.64. Found, 

for C4oH6g027N6, 1064.41. Found, 

for C34H58O22N6, 9 02.36. Found, 

for C34H5g022N6, 902.36. Found, 



538.55 (M+H) 
741.41 (M+H) 
700.43 (M+H) 
1065.67 (M+H) 
903 . 51 (M+H) 
903.51 (M+H) 



Example 5 : Generation of second level libraries 

Unglycosylated serine and threonine residues in the 
first level libraries are temporarily blocked by 
acetylation to prevent the next set of glycosyl donors from 
reacting with unreacted sites. Following purification the 
4 ^ 6_benzylidenyl protecting groups of all glycoforms are 
selectively removed by acid cleavage to create additional 
glycosylation sites on all the existing carbohydrate 
structures. GlcNAc, sialic acid (S) and 

N-acetyllactosamine (L) are reacted with the existing 
carbohydrate structures of the glycopeptides of each first 
level library, to produce second level libraries with new 
structures. Sialic acid and N-acetyllactosamine react at 
the 6-0- position of N-acetylgalactosamine common to the 
glycosylated Tn and TF components of the first level 
libraries . 

For example, the following second level libraries are 
created when libraries 1, 2 and 3 of Example 1 are reacted 
with sialic acid: 

4. 1 + S: 00, TnO, STnO, OTn, OSTn, TnTn, TnSTn, 

STnTn, STnSTn 

5. 2 + S: 00, TFO, STFO , OTF, OSTF, TFTF, TFSTF , 

STFTF, STFSTF 

6. 3 + S: 00, TnO, STnO, OTn, OSTn, TnTn, STnTn, 

TnSTn, STnSTn, TFO, STFO, OTF, OSTF, 
TFTF, STFTF, TFSTF, STFSTF, TnTF , 
STnTF, TnSTF, STnSTF, TFTn, STFTn, 
TFSTn, STFSTn 

Second level library 6 is more comprehensive than 4 and 5 
combined . 

The number of components may be increased further by 
using mixed donors, as in library 7. When library 3 from 
the first level is reacted with a mixture of sialic acid 
and N-acetyllactosamine, the following library is produced: 

7. 3 + SL: 00, TnO, STnO, OTn, OSTn, TnTn, STnTn, 

TnSTn, STnSTn, TFO, STFO, OTF, OSTF, 
TFTF, STFTF, TFSTF, STFSTF, TnTF, 
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STnTF 



TnSTF, 



STnSTF, 



TFTn, 



STFTn, 



TFSTn 



STFSTn, 



LTnO , 



OLTn, 



LTnTn, 



LTFTF 



TnLTn 



LTnLTn, LTFO , OLTF, TFLTF, 
LTFLTF, STnLTn, LTnSTn, LTFSTF, 



STFLTF 



Additional second level libraries can be formed by mixing 
each of libraries 1, 2 and 3 with N-acetyllactosamine , and 
by mixing each of libraries 1 and 2 with a mixture of 
sialic acid and N-acetyllactosamine. 

The ultimate size and definition of the library is 
controlled by the number and identity of donors, pre- 
blocking of defined sites on the peptide, and use of split- 
mix-split type of synthesis, as described in Plunkett et 
al., Scientific American, April 1997, 68-73, the contents 
of which are incorporated herein by reference. Generation 
of a second level libraries is summarized in Reaction 
Schemes V, VI, VII and VII. 

Synthesis of second level library with GlcNAc as donor 
Preparation of compound 31 

A solution of compounds 29, 0.4 g, compound 20, 1.2 g and 
2 g of 3 A molecular sieves in 15 mL of dry acetonitrile 
was stirred at room temperature for 10 min. under argon. 
The mixture was cooled to -20°C and 3 mL of 0 . 1 mol of 
TMS-OTF in dry acetonitrile was added in 5 min. The 
solution was stirred at -20°C for 1 hour and warmed to room 
temperature. The molecular sieves were filtered out and 
washed with ethyl acetate (5x5 mL) . The solvent was 
removed in vacuo and the residue was purified by silica gel 
column using hexane/ethyl acetate/methanol (5:10:2) to give 
0.64 g of a white powder after freeze drying from dioxane . 

Preparation of compound 32 

A solution of compound 31, 0 . 63 g and 4 g of activated zinc 
in 50 mL of 80% acetic acid in ethyl acetate was stirred at 
room temperature for 2 hours. Zinc was filtered out and 
washed with ethyl acetate (5x20 mL) . The solvent was 



removed in vacuo and the residue was dissolved in 15 mL of 
dry pyridine and 5 mL of dry acetic anhydride. The 
solution was stirred at room temperature for 12 hours. The 
solvent was removed in vacuo and the residue was purified 
by silica gel column using hexane/ethyl acetate/methanol 

(5:10:2) to give 0.60 g of a white powder after freeze 
drying from dioxane . A solution of above solid, 200 mg, in 
200 mL of 80% of aqueous acetic acid was stirred at 80°C 
for 2 hours and the solvent was removed in vacuo. The 
residue was dissolved in 5 mL of methanol and 20 mL of 0.1 
N NAOH was added and the mixture was stirred at room 
temperature for 24 hours. IR120+ resin was added to adjust 
pH=4.5 and the resin was filtered out and washed with water 

(5x5 mL) . Water was removed in vacuo and the residue was 
purified by P-2 column to give a solid, 15 mg . 



ES- 


MS 


















32- 


1 




Calc. 


for C2gH4g0nN6 


, 740 . 


31 . 


Found , 


741 . 


30 


(M+H) + 


32 


-2 




Calc. 


for C36H61O22N7 


, 943 . 


39 . 


Found , 


944 . 


46 


(M+H) + 


32 


-3 




Calc. 


for Cj.H^iO^^N, 


, 943 


39. 


Found, 


944 . 


46 


(M+H) + 


32 


-4 




Calc. 


for C44H74O 


27N8, 


1146 


.46 . 


Found , 


574 . 50 








(M+2H) 2+ 














32 


-5 




Calc. 


for C34H58O22N6 


, 902 


36 . 


Found , 


903 


41 


(M+H) + 


32 


-6 




Calc. 


for C42H71O27N7 


1105 


44 . 


Found, 


1106 


.72 


(M+H) + 


32 


-7 




Calc. 


for C42H71O27N7 


1105 


.44 . 


Found , 


1106 


. 72 


(M+H) + 


32 


-8 




Calc. 


for C56H94O37N8 


1470 


.57 . 


Found , 


737 . 


00 


(M+2H) + 


32 


-9 




Calc. 


for C5oHg4 0 32N8 


1308 


. 52 . 


Found , 


1309 


. 57 


(M+H) + 


32 


-10 


Calc. 


for C50H84O32N8 


1308 


. 52 . 


Found , 


1309 


.57 


(M+H) + 


32 


-11 


Calc. 


for C42H71O27N7 


1105 


.44 . 


Found , 


1106 


.72 


(M+H) + 


32 


-12 


Calc. 


for C42H71O27N7 


1105 


.44 . 


Found , 


1106 


. 72 


(M+H) + 


32 


-13 


Calc . 


for C48H8,032N7 


1268 


. 16 . 


Found, 


635 . 


0 (M+2H)2+ 


32 


-14 


Calc . 


for C48Hgi032N7 


1268 


. 16 . 


Found, 


635 . 


0 (M+2H)2+ 



Synthesis of second level library with N- acetyl lactosamine 
as donor 

Preparation of compound 3 3 

A solution of compound 29, 0.4 g, compound 23, 1.5 g and 2 
g of 3A molecular sieves in 15 mL of dry CHjClj was stirred 
at room temperature for 10 min. under argon. Three mL of 
0.1 mol of BFjOEtj in dry CHjClj was added in 10 min. The 
solution was stirred at room temperature for 1 hour. The 
molecular sieves were filtered out and washed with ethyl 
acetate (5xlO mL) and the solvent was removed in vacuo. 
The residue was purified by silica gel column using 
hexane/ethyl acetate/methanol (5:10:2) to give 0.43 g of a 
white powder after freeze drying from dioxane. 

Preparation of compound 34 

A solution of compound 33, 0.42 g in 30 mL of pyridine and 
2 mL of water was passed HjS gas for 3 0 min. The solution 
was stopped and stirred at room temperature for 4 8 hours. 
The solvent was removed in -vacuo and the residue was 
dissolved in 15 mL of dry pyridine and 5 mL of dry acetic 
anhydride. The mixture was stirred at room temperature for 
12 hours . The solvent was removed in vacuo and the residue 
was purified by silica gel column using hexane/ethyl 
acetate/methanol (5:10:2) to give 0.44 g of a white powder 
after freeze drying from dioxane. A solution of the above 
solid, 200 mg, was dissolved in 5 mL of methanol and 2 0 mL 
of 0.1 N NaOH and the mixture was stirred at room 
temperature for 24 hours. IR12 0+ resin was added to adjust 
pH=4 . 5 and the resin was filtered out and washed with water 
(5x5 mL) . Water was removed in vacuo and the residue was 
purified by P-2 column to give a solid, 12.7 mg . 



ES-MS 



34-3 



34-2 



34-1 



34-4 



Calc 



Calc 



Calc 



Calc 



for C34H58O22N6, 902.36. Found, 903.46 (M+H) + 

for C42H71O27N7, 1105.44. Found, 1106.54 (M+H) + 

for C42H7,027N7, 1105.44. Found, 1106.54 (M+H) + 

for C56H94037Ng, 1470.57. Found, 736.60 (M+2H) + 



34-5: Calc. for C40H68O27N6, 1064.41. Found, 1065.54 (M+H) + 

34-6: Calc. for C4gHg,032N7, 1267.49. Found, 1268.52 (M+H) + 

34-7: Calc. for C4gHgi032N7, 1267.49. Found, 1268.52 (M+H) + 

34-8: Calc. for C68Hu4047Ng, 1794.68. Found, 898.70 

(M+2H) 2+ 

34-9: Calc. for C62Hio4 0 42Ng , 1632.62. Found, 817.64 

(M+2H) + 

34-10: Calc. for C62Hio4042Ng, 1632.62. Found, 817.64 

(M+2H) + 

34-11: Calc. for C4gHgi032N7, 1267.49. Found, 1268.52 (M+H) + 

34-12: Calc. for C48H81O32N7, 1267.49. Found, 1268.52 

(M+H) + 

34-13: Calc. for C54H9,037N7, 1430.30. Found, 1430.71 

(M+H) + 

34-14: Calc. for C54H91O37N7, 1430.30. Found, 143 0.71 

(M+H) + 



Synthesis of second level library with sialic acid as donor 
Preparation of compound 3 5 

A solution of compounds 29, 0.4 g, compound 26, 0.8 g and 
2 g of 3A molecular sieves in 15 mL of dry THF was stirred 
at room temperature for 10 min. under argon and cooled to 
-20°C. Two mL of 0 . 1 mol of TMS-OTF in dry THF was added 
in 5 min. The solution was stirred at -20 °C for 1 hour. 
The molecular sieves was filtered out and washed with ethyl 
acetate (5x10 mL) and the solvent was removed in vacuo. 
The residue was purified by silica gel column using 
hexane/ethyl acetate/methanol (5:10:3) to give 0.55 g of a 
white powder after freeze drying from dioxane. 



Preparation of compound 3 6 

A solution of compound 35, 0.2 g in 10 mL of 0.1 N NaOH and 
5 mL of methanol was stirred at room temperature for 24 
hours. IR120+ resin was added to adjust pH=4 . 5 and the 
resin was filtered out and washed with water (5x5 mL) . 
Water was removed in vacuo and the residue was purified by 
P-2 column to give a solid, 18.3 mg . 



ES-MS 
36-1 : 
36-2 : 
36-3 : 



36-5 : 
36-6 : 
36-7 : 
36-8 : 



36-11: 
36-12 : 



36-13 : 
36-14 : 



Calc. for C31H52O20N6, 828.32. Found, 829.60 (M+H) + 

Calc. for C39H65O23N7, 1031.40. Found, 1032.80 (M+H) + 
Calc. for C39Hg5025N7, 1031.40. Found, 1032.80 
(M+H) + 

Calc. for C5oHg2033Ns, 1322.50. Found, 662.00 
(M+2H) 2+ 

Calc. for C37H52O25N6, 990.37. Found, 991.70 (M+H) + 

Calc. for C45H75O30N7, 1193.45. Found, 1194.80 (M+H) + 

Calc. for C45H75O30N7, 1193.45. Found, 1194.80 (M+H) + 
Calc. for C62H,o2 0 43Ng, 
(M+2H) 2+ 

Calc. for CsgHgaOjgNg, 
(M+2H) 2 + 

C56Hc,2038N8, 1484.55. 



1646.60. Found, 824.20 
1484.55. Found, 743.60 
Found, 743.60 



Calc. for 
(M+2H) 2+ 

Calc. for C45H75O30N7, 1193.45. Found, 1194.80 (M+H) + 
Calc. for C45H75O30N7, 1193.45. Found, 1194.80 
(M+H) + 

Calc. for C51H85O35N7, 1356.22. Found, 1356.60 (M+H) + 



Calc. for C5iHg5 
(M+H) + 



1356.22. Found, 



Synthesis of second level library with GlcNAc and sialic 
acid as donors 

Preparation of compound 3 7 

A solution of compounds 29, 0.2 g, compound 26, 0.22 g and 
compound 2 0, 0.22g with 2 g of 3A molecular sieves in 15 mL 
of dry THF was stirred at room temperature for 10 min. 
under argon and cooled to -20°C. One mL of 0.1 mol of 
TMS-OTF in dry THF was added in 5 min. The solution was 
stirred at -2 0°C for 1 hour. The molecular sieves was 
filtered out and washed with ethyl acetate (5x10 mL) and 
the solvent was removed in vacuo. The residue was purified 
by silica gel column using hexane/ethyl acetate/methanol 
(5:10:1) to give 0.27 g of a white powder after freeze 
drying from dioxane . 



Preparation of compound 3 8 

A solution of compound 37, 0.27 g and 1 g of activated zinc 
in 2 0 mL of 80% acetic acid in ethyl acetate was stirred at 
room temperature for 2 hours. Zinc was filtered out and 
washed with ethyl acetate (5x20 mL) . The solvent was 
removed in vacuo and the residue was dissolved in 4 mL of 
dry pyridine and 2 mL of dry acetic anhydride. The 
solution was stirred at room temperature for 12 hours. The 
solvent was removed in vacuo and the residue was purified 
by silica gel column using hexane/ethyl acetate/methanol 
(5:10:1) to give 0.25 g of a white powder after freeze 
drying from dioxane . A solution of above solid, 0.25 g, in 
10 mL of 80% of aqueous acetic acid was stirred at 80°C for 
2 hours and the solvent was removed in vacuo. The residue 
was dissolved in 3 mL of methanol and 7 mL of 0.1 N NaOH 
was added and the mixture was stirred at room temperature 
for 24 hours. IR120+ resin was added to adjust pH=4 . 5 and 
the resin was filtered out and washed with water (5x5 mL) . 
Water was removed in vacuo and the residue was purified by 
P-2 column to give a solid, 16.1 mg . 

Example 6 : Screening of GSTA libraries 

An ELISA assay with antibodies that bind to a solid 
phase of either Ovine Submaxillary Mucin (OSM) or CA27.29 
were used in an inhibition format to screen the GSTA 
libraries, with library components being used as 
inhibitors. Wells were coated with 110 juL of antigen 
diluted in PBS and allowed to stand overnight at 4°C. 
One jag/mL OSM or 10 U/mL CA27.29 were used in the assay. 
On the day of the assay, the wells were aspirated and 
washed twice with PBS, then 200 /^L of 2% BSA blocking 
solution was added to each well. The wells were incubated 
2 hours at room temperature . 

The monoclonal antibodies and inhibitors were diluted 
in 1% FBS/PBS to various concentrations using glass culture 
tubes. Monoclonal antibody and inhibitor were combined in 
pre- incubation tubes. The zero inhibition tube received 



monoclonal antibody and 1% FBS/PBS diluent. The 
preincubation for each plate was completed exactly one hour 
prior to the end of the blocking incubation. When the 
blocking incubation was complete, the wells were aspirated 
and washed 4 times with TPBS (PBS + 0.05% Tween 20), and 
100 )LiL of the pre -incubation mixture was added to duplicate 
wells. Blank and substrate negative control wells received 
100 iiL, of 1% FBS/PBS diluent only. 

Wells were incubated for 90 minutes at room 
temperature. Just prior to the end of incubation, goat 
anti-mouse IgG, H+L, HRP labelled was diluted with 1% 
FBS/PBS. The wells were aspirated and washed 4 times with 
TPBS, and 90 juL of diluted goat ant i -mouse HRP was added to 
all wells except the substrate negative control wells, 
which received 90 /xL of 1% FBS/PBS. 

The wells were incubated for 90 minutes at room 
temperature, and then aspirated and washed 4 times with 
TPBS. Equal volumes of ABTS and peroxide solution B 
substrate were mixed, and 100 juL was added to each well. 
The plate was immediately put into a plate reader and read 
on kinetic mode at wavelength 405-490 nm, 10 minute read, 
20 second interval. Data were expressed in mOD/min, and 
results are shown in Table 3 and Figure 8 . 




HO 

a 
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27-1. R=Tni. Ri=H 
27-2. R=Ri=Tni 
27-3. R=TFi. Ri=H 
27-4. R=Ri=TFi 
27-5. R=Tni, Ri=TFi 
27-6. RaTFi.Ri=Tni 



28-1. R=Tni, Ri=:Ac 
28-2. R=R-,=Tni 
28-3.R=TFi. Ri=Ac 
28-4. R=Ri=TFi 
28-5. RssTni, Bi=TFi 
28-6. R=TF-i. Ri=Tni 



29-1. R=Tn2. Ri=Ac 
29-2. R=Ri=Tn2 
29-3. H=TF2. Hi=Ac 
29-4. R=nT=TF2 
29-5. R=Tn2, Rl=TF2 
29-6. R=TF2, Ri=Tn2 



30-1. R=Tn3. Ri=H 
M-2. R=R-i=Tr.3 
30-3. R=TF3. Ri=H 
3(M. R=Ri=TF3 
30-5. R=Tn3. Ri=TF3 
30-6.R=TF3. R1=Tn3 



Scheme iV. Rrst level of the library 
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9-1. R=Tn2, R-)=Ac 
23-2. H=R-)=Tn2 
23-3. R=TF2, Ri=Ac 
29-4. R=n^=TF2 
29-5. R=Tn2, Ri=TF2 
29-6. R=TF2, Ri=rn2 
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ACHN 
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1) 2inc 

2) AcgO/Py 
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31-1. R=Core6i, Fil=Ac 
31-2. RaTn2, Ri=»Core6i 
31-3. R=Core6i, R^sTna 
31-4. R=R^=Core6i 
31-5. H=Core2i,Ri=Ac 
31-6. R=Tn2, Ri=Core2^ 
31-7. R=Core2i, R-)=Tn2 
31-8. R=H^=Core2i 
31-9. R=Core2^, R^=Core6i 
31-10. R=Core6-(, R.j=Core2^ 
31-11. R=TF2, R.^=Core6i 
31-12. H=Core 6^ Rl=TF2_ 
31-13. H=TF2, Ri=Core2^ 
31-14. R=Cora 2^, Ri=TF2 



32-1. R=CQre62, Rl=H 
32-2. RaTns. Ri=Core62 
32-3. R=Core62. Ri=Tn3 
32-4. R5»R.|=Core62 
32-5. R=Core 22, Ri=H 
32-6. R=Tn3, Ri=Core22 
32-7. R=Core22, Ri=Tn3 
32-8. R=Rt=Core22 
32-9. R=Core22, Ri=Core62 
32-1 0. R=Core 62, R 1 =Core 22 
32-11. R=TF3, R.j=Core62 
32-12. R=Core 62. Ri=TF3 
32-13. R=TF3, R^=Core22 
32-14. RsCore 22, Ri=TF3 



Scheme V. Second levei of the library with GlcNAc as donor 
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BF3-OE12 
CH2CI2, r.t.. 



29-1. R=Tn2, Ri«Ac 
29-2. R=H^=Tn2 
23-3. R=TF2, Ri«Ac 
29-1. R=Ri=TF2 
29-5. R=Tn2, Ri»TF2 
29-6. R=TF2, Hi»Tn2 
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33-1. RsFIa^, Hi»Ac 
33-2. RaTn2, Ri-FIa-] 
33-3. R=F1o,, R^«Tn2 
33-4. R= R,«F1a<| 
33-5. R=T8tfa<j, flixsAc 
33-6. R=Tn2, Hi-Tetra^ 
33-7. R=Tetra^.R^=Tn2 
33-a. R=i R^aTWiai 
33-3. RsTetra^, R^aFla^ 
33-10. R=F1a^, R^=7etra1 
33-11. R=.TF2,Ri=F1a^ 
33-12. R»F1a^.R»TF2 
33-13. RWTetia^, R^=TF2 
33-14. R=:TF2t Ri«Tetra-, 



34.1.R=F1a2.Ri=H 
34-2. R=Tn3, Ri=Fl02 
34-3. R=Fta2. ^1=703 
34-4. R= R^-Fta2 
34-5. R=Tetra2. B^=H 
34-6. R=Tn3, Ri=Tetra2 
34-7. R=Tetra2, F'l=Tn3 
34-8. R= Hi«Tetfa2 
34-9. R=Tetra2. Ri=F1a2 
34-10. R=F1 02, Ri=Tetra2 
34-11. R=TF3,R^=F1a2 
34-12. R=F1 02. R=TF3 
34-13. R=Tetia2. Hi=TF3 
34-14. R=TF3. Ri=Telra2 



Scheme Vl.Second level of the library with N-acetyi lactosamine as donor 
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AcO OAC COO^ 

BzOJi i^A 

STni AcHN j 



AcHN-.Z-r-~-V p 
HO HO / 

ACHN 



STn2 

AcMN*X.-7~^ P 
AcO 

D?**^ HO 



AeO 
AeO 



HO 
STF2 




29-1 . R=Tn2. Ri=Ac 
23-2. R=ni=7n2 
29-3. R=TF2, Ri=Ac 
29-4. n=H-!=TF2 
29-5. R=Tn2, R>|«TF2 
29-6. H=T-2. Ri»7n2 



acmnZZ-t-SJ/ 





AcO 

26 



0P<0B9j 



35-1. R=STn^,R-i«Ac 
35-2. R=Tn2, H-j«STni 
35-3. R=STn-|, H^«Tn2 
35-4. H= Ri«STn^ 
3S.5.R=STFi.Hl«Ac 
35-6. R=Tn2, H-j-STF^ 
35-7. H=STFi. R.,=Tn2 
35-8.R=R.,»STF^ 
35-9. R=STF^. R,=STni 
35-10. R=STn^, R^=STFi 
35-11. R=TF2. Ri=STni 
35-12.R=STni, Ri=TF2 
35-13. R=TF2. Hi«STFi 
35-1 4. R=STFi. Ri=TF2 



36-1. R=STn2. Ri=H 
36-2. R=Tn3, R^=STn2 
36-3. R=STn2, Ri=Tn3 
36-4. R= Ri»STn2 
36-5. RsSTFj. Ri«H 
36-6. R=Tn3, R.|»STF2 
36-7. R=STF2. R^sTns 
38-8. R= R-|aSTF2 
36-9. R=STF2. R-(=STn2 
36-10. H=STn2. Ri=STF2 
36-11. R»TF3. Hi«STn2 
36-12.R=STn2. Ri=TF3 
38-13. RsTFg. R.,=STF2 
36-14. R=STF2. Ri=TP3 



Scheme VU.Second level of the library with sialic acid as donor 



FmocHN 




29-1. R=Tn2, R-t=Ac 
23-2. R=H-,=Tn2 
23-3. R=TF2, Ri=Ac 
29-4. R=Hi=7F2 
29-5. R=Tn2, Ri=TF2 
23-6. R=TF2, Ri=Tn2 



AeO TMS-OT? 

V ^ OAc CH3CN. -20«C 



TracHN I 



CO, 




37-1 . R=Core 61 , Ri =Ac 
37-2. R=Tn2, R-|=Core 6.| 
37-3. H=Core 6.j, R.|=Tn2 
37-4. R= Ri=Core 6-| 
37-5. RsCore 2-\ . Ri=Ac 
37-6. R=Tn2, Ri=Core 2^ 
37-7. RaCora 2-^, R-j=Tn2 
37-8. R=Ri=Core2i 
37-9. RsCore 2-( . Ri=Core6^ 
37-10. R=Core 6-] , Hi=CorB 2^ 
37-11. R=STn^, R-i=Ac 
37-12. R=Tn2, R'jsSTn-j 
37-13. R=STn.,, Ri=Tn2 
37-14. R= Ri=STn1 
37-15. H=TF2, Ri=Core 6^ 
37-16. RsCora 61, Ri=TF2 
37-17. R=TF2, Ri=Cor8 2^ 
37-1 a. R=Core 2^ , =TF2 



37-19. R=TF2, R-|=STni 
37-20. RsSTn^. Ri=TF2 
37-21. R=STF1,Ri=Ac 
37-22. R=Tn2, Ri=STF^ 
37-23. R=STF^, R.^=Tn2 
37-24. R=srrFi.Rl=TF2 
37-25. R=TF2.Ri=STFi 
37-26. R=Ri=STFi 
37-27. R=STFi, Ri=STn^ 
37-28. R=STni, R-j=STF^ 
37-29. R=STn.,, R.)=Core 6-\ 
37-30. R=Core 6 ^ , R ^ =STn 
37-31. R=STFi, Ri=Core6i 
37-32. R=Core6^, R^=STF^ 
37-33. RssSTn^, Rt=Core 2^ 
37-34. R=CorB2i. Ri=STn^ 
37-35. R=STFi. R^sCore 2i 
37-36. R=Core2i, Ri=STFi 



1) Zinc 

2) Ac20/Py 

3) 0.1 N NaOH 
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3«-1. R=Cor8 62. Hl=H 
38-2. R=Tn3. R-^=CorB 63 
38-3. R=Core 62. Ri=Tn3 
38-4. R=R-,=Core 62 
38-5. R=Core22, i=il=H 
38-6. R=Tn3, Ri=Core 22 
38-7. R=CorB Ri=Tn3 
38-8. R= R.]=Cor8 23 
38-9. R=Core 22. Ri=Cor8 62 
38-10. R=Cor8 62. Ri=Core22 
38-n. R=STn2. Rl=H 
38-12. R=Tn3, Ri=STn2 
38-13. R=STn2. Ri=Tn3 
38-14. R= R-j=STn2 
38-15. R=TF3. R^=Care 62 
38-16. R=Core 62. Ri=TF3 
38-17. R=TF3, R^=Core 22 
38-18. R=Core Ri=TF3 



38-19. fl=TF3, R^=STn2 
38-20. R=STn2. Ri=TF3 
38-21. R=STF2, Rl=H 
38-22. R=Tn3. Ri=STF2 
38.23. R=STF2. Ri=Tn3 
38-24. R=STF2. Ri=TF3 
38-25. R=TF3, R^=STF2 
38-26. R= R^=STF2 
38-27. R=STF2. Ri=STn2 
38-28. R=STn2, Ri=STF2 
38-29. R=STn2, R^=Core 62 
38-30. R=CorB 62. R^sSTng 
38-3 1 . R=STF2, R t =Core 62 
38-32. R=Cora 62, Ri=STF2 
38-33. R=STn2, R-i=CorB 22 
38-34. R=Core 22. Ri=STn2 
38-35. R=STF2. R^=Core 22 
38-36. R=Core 22- R-j=STF2 



Scheme VI 11. Second level of the library with GlcNAc 
and sialic acid as donors 
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We claim; 



1. A method of generating a glycopeptide library, 
comprising the steps of: 

(a) randomly glycosylating a platform having at least 
one glycosylation site with at least one glycosyl donor, 
optionally blocking unreacted glycosylation sites on the 
glycosylated platforms and optionally selectively removing 
one or more protecting groups on the carbohydrate groups 
introduced at the first level; whereby a first level 
library of glycosylated platforms is created; and then 

(b) optionally randomly glycosylating said first level 
library of glycosylated platforms, or a combination of 
first level libraries of glycosylated platforms, with at 
least one glycosyl donor, and optionally selectively 
removing one or more designated protecting groups on the 
carbohydrate groups introduced at the second level; whereby 
a second level library of glycosylated platforms is 
created . 

2. A method according to claim 1, which further 
comprises further randomly glycosylating said second level 
library of glycosylated platforms, or a combination of 
second level or first and second level libraries of 
glycosylated platforms, with at least one glycosyl donor, 
and optionally selectively removing one or more designated 
protecting groups on the carbohydrate groups introduced at 
the third level; whereby a third level library of 
glycosylated platforms is created; and optionally repeating 
the foregoing step to produce fourth and higher level 
libraries of increased diversity. 

3. A method according to claim 2, wherein said 
peptide has an amino acid sequence GVTSAPDTRPAPGSTA. 

4. A method according to claim 2, wherein said 
peptide has an amino acid sequence GSTA. 
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5. A method according to claim 2, wherein said 
unreacted glycosylation sites are blocked. 

6. A method according to claim 5, wherein said sited 
are blocked by acetylation. 

7. A method according to claim 3, wherein said 
glycosyl donors are selected from the group consisting of 
GalNAc, SGal (1-3 ) aGalNAc and sialyl. 

8. A method according to claim 4, wherein said 
glycosyl donors are selected from the group consisting of 
GalNAc, SGal (1-3) aGalNAc and sialyl. 

9. A method according to claim 1, wherein hydroxyl 
groups on said glycosyl donors are protected prior to 
reaction of said glycosyl donors with said platforms or 
said glycosylated platforms. 

10. A method according to claim 9, wherein said 
hydroxyl groups are deprotected after reaction with said 
platforms or said glycosylated platforms. 

11. A method according to claim 10, wherein some of 
said hydroxyl groups are removed during said deprotection 
step . 

12. A method according to claim 1, wherein said 
platform is a peptide. 

13. A method according to claim 1, wherein said 
platform does not contain peptide linkages. 



14. A method according to claim 1, wherein said 
platform comprises natural glycosylation sites. 



15. A method according to claim 1, wherein said 
platform comprises unnatural glycosylation sites. 

16. A method according to claim 1, wherein said 
platform comprises tandem repeats. 

17. A method according to claim 1, wherein each 
glycosylation site on said platform is unique and 
distinguishable from other sites due to distinct structural 
features in the vicinity of the site. 



18. A method according to claim 1, wherein said 
platform is a hybrid platform comprising a non-peptide 
polymer to which natural amino acid side chains with 
natural glycosylation sites are attached. 

19. A method according to claim 1, wherein said 
glycosylation sites provide hydroxy functions for 
0-glycosylation or carboxy or carboxamido functional groups 
for N-glycosylation. 

20. A method according to claim 1, wherein said 
glycosylation sites include one or more of serine, 
threonine, hydroxylysine and asparagine . 

21. A method according to claim 1, wherein said 
glycosylation sites consist entirely of d-optical 
configuration . 

22. A method according to claim 1, wherein said 
platform is constructed entirely of d-amino acids. 

23. A method according to claim 1, wherein said 
platform is linear. 



24. A method according to claim 1, wherein said 
platform is cyclic. 



25. A method according to claim 1, wherein said 
platform comprises a UV-active or fluorescent label. 

26. A method according to claim 1, wherein said 
platform comprises hydrophobic amino acids which increase 
the solubility of the platform in organic solvents. 

27. A method according to claim 1, wherein said 
glycosylation sites are spaced, singly or in clusters, 
between sequences that include hydrophobic amino acids. 

28. A method according to claim 1, wherein lipid 
chains are incorporated into said platform. 

29. A method according to claim 1, wherein said 
glycosyl donors are unnatural . 

30. A method according to claim 1, wherein said 
glycosyl donors comprise structures associated with 
adhesion ligands for bacterial receptors that are expressed 
on human cell surface antigens. 

31. A method according to claim 1, wherein said 
glycosyl donors comprise structures associated with 
malignant cell antigens. 

32. A randomly-generated glycopeptide library. 

33 . A randomly-generated glycopeptide library 
according to claim 32, comprising carcinoma-associated 
mucins . 

34. A library of glycosylated platforms produced by 
the method of claim 1 . 



35. A library of glycosylated platforms produced by 
the method of claim 2 . 



36. A library of glycosylated platforms produced by 
the method of claim 30. 

37. A library of glycosylated platforms produced by 
the method of claim 31. 

38. A method of identifying a biologically-active 
compound, comprising: 

generating a library of glycosylated platforms 
according to claim 34; and 

screening components of said library for drug-like, 
competitive inhibitory, immunostimulatory or ant ibody- like 
activity. 

39. A method of identifying an anti-viral compound, 
comprising : 

generating a library of glycosylated platforms 
according to claim 34; and 

screening components of said library for anti-viral 
activity . 

40. A method of identifying an anti-bacterial 
compound, comprising: 

generating a library of glycosylated platforms 
according to claim 30; and 

screening components of said library for the ability 
competitively to inhibit bacterial adhesion to a host cell. 

41. A method of identifying compounds for detection 
or treatment of cancer, comprising: 

generating a library of glycosylated platforms 
according to claim 31; and 

screening components of said library for anti-cancer 
activity . 



Abstract of the Disclosure 



Randomly generated glycopeptide combinatorial 
libraries are generated by randomly glycosylating a peptide 
having at least one glycosylation site with at least one 
glycosyl donor, optionally blocking unreacted glycosylation 
sites on the glycopeptides and optionally selectively 
removing one or more protecting groups on the carbohydrate 
groups introduced at the first level; whereby a first level 
library of glycopeptides is created; and then optionally 
randomly glycosylating said first level library of glyco- 
peptides, or a combination of first level libraries of 
glycopeptides, with at least one glycosyl donor, and 
optionally selectively removing one or more designated 
protecting groups on the carbohydrate groups introduced at 
the second level; whereby a second level library of glyco- 
peptides is created. Further iterations of the process 
result in higher level libraries of increased diversity. 
The glycopeptide libraries including, e.g., carcinoma- 
associated mucins such as MUCl, are screened for drug-like, 
competitive inhibitory, immunost imulatory , antibody-like , 
and other biological activities. 
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Combinatorial glycopeptides 



O1, O2, O3 = Giycosylation sites 

R-l to R5 = Side chains that create site specificity 



Figure 1 
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A CYCLIC IVIUC1 PEPTIDE 



Figure 2 




A SOLUBLE VERSION OF THE ABOVE (with C14 lipid) 
Figure 3 




Fig y re 4 




Figure 5 




Figure 6 



FIGURZ 8. 



Functional Demonstration of Glycopeptlde Library 
With Well Characterized Monoctonai Antibodies 
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